Encoding Contextual Information by Interlacing Transformer and Convolution for Remote Sensing Imagery Semantic Segmentation
نویسندگان
چکیده
Contextual information plays a pivotal role in the semantic segmentation of remote sensing imagery (RSI) due to imbalanced distributions and ubiquitous intra-class variants. The emergence transformer intrigues revolution vision tasks with its impressive scalability establishing long-range dependencies. However, local patterns, such as inherent structures spatial details, are broken tokenization transformer. Therefore, ICTNet is devised confront deficiencies mentioned above. Principally, inherits encoder–decoder architecture. First all, Swin Transformer blocks (STBs) convolution (CBs) deployed interlaced, accompanied by encoded feature aggregation modules (EFAs) encoder stage. This design allows network learn patterns distant dependencies their interactions simultaneously. Moreover, multiple DUpsamplings (DUPs) followed decoded (DFAs) form decoder ICTNet. Specifically, transformation upsampling loss shrunken while recovering features. Together decoder, well-rounded context captured contributes inference most. Extensive experiments conducted on ISPRS Vaihingen, Potsdam DeepGlobe benchmarks. Quantitative qualitative evaluations exhibit competitive performance compared mainstream state-of-the-art methods. Additionally, ablation study DFA DUP implemented validate effects.
منابع مشابه
EarthMapper: A Tool Box for the Semantic Segmentation of Remote Sensing Imagery
Deep learning continues to push state-of-the-art performance for the semantic segmentation of color (i.e., RGB) imagery; however, the lack of annotated data for many remote sensing sensors (i.e. hyperspectral imagery (HSI)) prevents researchers from taking advantage of this recent success. Since generating sensor specific datasets is time intensive and cost prohibitive, remote sensing researche...
متن کاملDeep Neural Networks for Semantic Segmentation of Multispectral Remote Sensing Imagery
A semantic segmentation algorithm must assign a label to every pixel in an image. Recently, semantic segmentation of RGB imagery has advanced significantly due to deep learning. Because creating datasets for semantic segmentation is laborious, these datasets tend to be significantly smaller than object recognition datasets. This makes it difficult to directly train a deep neural network for sem...
متن کاملLow-Shot Learning for the Semantic Segmentation of Remote Sensing Imagery
Recent advances in computer vision using deep learning with RGB imagery (e.g., object recognition and detection) have been made possible thanks to the development of large annotated RGB image datasets. In contrast, multispectral image (MSI) and hyperspectral image (HSI) datasets contain far fewer labeled images, in part due to the wide variety of sensors used. These annotations are especially l...
متن کاملUnderstanding Convolution for Semantic Segmentation
Recent advances in deep learning, especially deep convolutional neural networks (CNNs), have led to significant improvement over previous semantic segmentation systems. Here we show how to improve pixel-wise semantic segmentation by manipulating convolution-related operations that are of both theoretical and practical value. First, we design dense upsampling convolution (DUC) to generate pixel-...
متن کاملRemote Sensing and Land Use Extraction for Kernel Functions Analysis by Support Vector Machines with ASTER Multispectral Imagery
Land use is being considered as an element in determining land change studies, environmental planning and natural resource applications. The Earth’s surface Study by remote sensing has many benefits such as, continuous acquisition of data, broad regional coverage, cost effective data, map accurate data, and large archives of historical data. To study land use / cover, remote sensing as an effic...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Remote Sensing
سال: 2022
ISSN: ['2315-4632', '2315-4675']
DOI: https://doi.org/10.3390/rs14164065